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Abstract 

The degrees-of-freedom of a K-user Gaussian interference channel (GIFC) has been defined 
to be the multiple of (l/2)log 2 P at which the maximum sum of achievable rates grows with 
increasing P. In this paper, we establish that the degrees-of-freedom of three or more user, real, 
scalar GIFCs, viewed as a function of the channel coefficients, is discontinuous at points where 
all of the coefficients are non-zero rational numbers. More specifically, for all K > 2, we find 
a class of if-user GIFCs that is dense in the GIFC parameter space for which K/2 degrees-of- 
freedom are exactly achievable, and we show that the degrees-of-freedom for any GIFC with 
non-zero rational coefficients is strictly smaller than K/2. These results are proved using new 
connections with number theory and additive combinatorics. 

1 Introduction 

The time- invariant, real, scalar K-user Gaussian interference channel (GIFC), as introduced in [1], 
involves K transmitter-receiver pairs in which each transmitter attempts to communicate a uni- 
formly distributed, finite- valued message to its corresponding receiver by sending a signal comprised 
of n real numbers. Each receiver observes a component- wise linear combination of possibly all of the 
transmitted signals plus additive memoryless Gaussian noise, and seeks to decode, with probability 
close to one, the message of its corresponding transmitter, in spite of the interfering signals and 
noise. The time averages of the squares of the transmitted signal values are required to not exceed 
certain power constraints. A A"-tuple of rates (Ri, ■ ■ ■ ,Rr) is said to be achievable for a GIFC 
if the transmitters can increase the sizes of their message sets as 2 nRi with the signal length n, 
and signal in such a way that the power constraints are met and the receivers are able to correctly 
decode their corresponding messages with probability converging to 1, as n grows to infinity. The 
set of all achievable if -tuples of rates is known as the capacity region of the GIFC. Determining 
it, as a function of the channel coefficients (specifying the linear combinations mentioned above), 



power constraints, and noise variances, has been an open problem in information theory for over 
30 years. 

A complete solution for even the two-user case, which has received the most attention to date, is still 
out of reach. The best known coding scheme for two users is that presented in [2] . In some ranges 
of channel coefficients, such as for strong interference, the capacity region is completely known for 
two users pQ. Still for other ranges, the maximum achievable sum-of-rates is known p2 HI El E] • For 
the general two-user case, the strongest known result is that of [7], which determines the capacity 
region to within a 1/2 bit margin (1 bit for the complex case) using a carefully chosen version of 
the scheme of [2j, and a new genie-aided outerbound. 

The case of K > 2 users has, until very recently, received less attention. Much of the recent 
effort on K > 2, beginning with |8j and continuing in e.g., [9], [10] has focused on characterizing the 
growth of the capacity region in the limit of increasing signal-to-noise ratio (SNR) corresponding, for 
example, to fixing the noise variances and channel coefficients and letting the power constraints tend 
to infinity. Specific attention has been directed at the growth of the maximum sum of achievable 
rates. If there were no interference, the maximum achievable rate corresponding to each transmitter- 
receiver pair would grow like (1/2) log 2 P in the limit of increasing power, which follows from the 
well known formula of (1/2) log 2 (l + P/N) for the capacity of a single user additive Gaussian noise 
channel with power constraint P and noise variance N. Thus, the maximum sum of achievable rates 
would grow as (K/2) log 2 P if there were no interference. This motivates the expectation that, in 
the general case, the maximum sum of achievable rates would grow as (d/2) log 2 P for some constant 
d < K, depending on the channel coefficients, where d has been dubbed the degrees-of-freedom 
of the underlying GIFC. Although determining d for a given GIFC is, in principle, simpler than 
determining the capacity region, it has turned out to be a difficult problem in its own right, for 
K>20 

A positive development in the study of the degrees-of-freedom of GIFCs with more than two users 
has been the discovery of a new coding technique known as interference alignment, which involves 
carefully choosing the transmitted signals so that the interfering signals "align" benignly at each 
receiver [5]. Interference alignment has been shown, under some conditions which we summarize 
below, to achieve nearly d = K/2 degrees-of-freedom, which is half of the degrees-of-freedom in the 
case of no interference at all. Interference alignment is not possible to implement for two users and 
its discovery thus had to wait until the focus shifted to more users. Another new phenomenon in 
network information theory that has recently emerged as the number of users studied was increased, 
is the technique of indirect decoding, which is crucial for achieving the capacity region of certain 
three-user broadcast channels [12]. Again, this technique is not relevant in the two-user case, and 
could not have been discovered in the study thereof. 

In this paper, we find a new information theoretic phenomenon concerning interference channels 
that is not manifest in the two-user case. In particular, we find that the degrees-of-freedom (and 

lr The degrees-of-freedom is known to be 1 for all two-user GIFCs, unless there is no interference [8]. 
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therefore the capacity region at high signal-to-noise ratio) of real, scalar GIFCs with K > 2 users 
is very sensitive to whether the channel coefficients determining the linear combinations of signals 
at each receiver are rational or irrational numbers. Next, we formally explain our results and their 
significance in the context of the growing literature on K > 2 user GIFCs. 

We shall use a matrix H to denote the direct and cross gains of a time invariant, real, scalar 
K-user (GIFC) [T] with the (i,j)-ih entry hij specifying the channel gain from transmitter i to 
receiver j. Thus, the signal observed by receiver j S {1, . . . , K} at time index t = 1, 2, . . . is given 
by Zjt + ~Yld = i Xijhij where xn is the real valued signal of transmitter i £ {1, . . . , K} at time t 
and Zj t t is additive Gaussian noise with variance crj, independent across time and users. Fixing a 
block length n, the transmitted signals {#i,t} are required to satisfy the average power constraints 
Y2t=l x it — n ^ >i f° r some collection of powers Pi,... ,Pr- For H 6 W KxK , and <x,P E M+, we 
let C(H,cr,P) denote the capacity region of a GIFC with gain matrix H, receiver noise variances 
given by the corresponding components of <r, and average (per codeword) power constraints given 
by the components of P. Following [8], we define the degrees-of- freedom of H as 

where 1 denotes the vector of all ones. The degrees-of-freedom of a GIFC characterizes the behavior 
of the maximum achievable sum rate as the SNR tends to infinity, with the gain matrix fixed. 

A fully connected GIFC is one for which hij ^ for all i and j. It was shown in [8] that for 
fully connected H, DoF(H) < K/2. If H is not fully connected, the degrees-of-freedom can be 
as high as K, such as when H is the identity matrix where all cross gains are zero. Little was 
known about tightness of the K/2 bound for K > 2 until it was shown in [9] that for vector GIFCs 
and an appropriate generalization of DoF(-) to include a normalization by the input/output vector 
dimension, the degrees-of-freedom of "almost all" fully connected vector GIFCs approaches K/2 
when the vector dimension tends to infinity^] In addition, an example of a fully connected two- 
dimensional vector GIFC achieving exactly K/2 degrees-of-freedom was also given in [9j. The key 
tool introduced in [9J to establish these results is the technique of interference alignment, which 
involves the transmitters signaling over linear subspaces that, after component-wise scaling by 
the cross gains, align into interfering subspaces which are linearly independent with the directly 
received subspaces, allowing for many interference free dimensions over which to communicate. For 
real, scalar GIFCs, it was shown in [10], using a different type of interference alignment, that the 
degrees-of-freedom of certain fully connected GIFCs also approaches K/2 when the cross gains tend 
to zero. Yet a different type of interference alignment is used in [11] to find new achievable rates for 
a non-fully connected GIFC in which interference occurs only at one receiver. To our knowledge, 
the problem of determining or computing the degrees-of-freedom of general GIFCs is still open. 

As in [TO], in this paper we consider only fully connected scalar, real GIFCs and establish the 
following results on the degrees-of-freedom. 



2 The K/2 bound of [8] extends to the fully connected vector case as well. 
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Theorem 1 If all diagonal components of a fully connected H are irrational algebraic numbers 
and all off-diagonal components are rational numbers then DoF(H) = K/2. 

Theorem 2 For K > 2, if all elements of a fully connected H are rational numbers then DoF(H) < 
K/2. 

The following corollary is then immediate from Theorems [T] and El and the well known fact that 
irrational algebraic numbers are dense in the real numbers. 

Corollary 1 For K > 2, the function DoF(H) is discontinuous at all fully connected H with 
rational components. 

Theorem Q] demonstrates the existence of fully connected, real iT-user GIFCs with exactly K/2 
degrees-of- freedom. In contrast to the result of [10] . Theorem [1] is non-asymptotic (in H). The 
underlying achievability scheme is based on an interference alignment phenomenon that differs from 
the ones used in [9J and [10] . and relies on number theoretic lower bounds on the approximability 
of irrational algebraic numbers by rationals. 

Theorem [2] reveals a surprising limitation on DoF(H) when the components are non-zero rational 
numbers (up to arbitrary pre-post multiplication by diagonal matrices - see Lemma Q] in Section [3]). 
In this case, DoF(H) is strictly bounded away from K/2. Previously known techniques for finding 
outer-bounds to the capacity regions of GIFCs, such as cooperative encoding and decoding [HUH], 
genie aided decoding [151 (EL an d multiple access bounds [HE] are not sensitive to the rationality 
of the channel parameters and hence do not suffice to establish Theorem [2j Instead, our proof of 
this theorem is based on a new connection between GIFCs with rational H and results from additive 
combinatorics [2D], a branch of combinatorics that is concerned with the cardinalities of sum sets, 
or sets obtained by adding (assuming an underlying group structure) any element of a set A to any 
element of a set B. 

The remainder of the paper is organized as follows. The next section clarifies some notation and 
gives the formal definition of the capacity region of a GIFC that will apply in this paper. In 
Section [3l we present the proof of Theorem [IJ This is followed by the proof of Theorem [2] in 
Section [H which further consists of subsections collecting various intermediate results. Each of 
these sections is prefaced with a high level outline of the respective proofs. In Section [5J we 
determine lower and upper bounds on DoF{H) for a simple three-user rational H by improving on 
the scheme of [10], and evaluating an upper bound implicit in the proof of Theorem[2j We conclude 
in Section [6] with some final observations and directions for future work. 
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2 Notation and definitions 



We adopt the usual notation for the information theoretic quantities of discrete and differential 
entropy (resp. H(X) and h(X)), and mutual information (I(X;Y)), which shall all be measured in 
bits (i.e. involve logarithms to the base two) [18]. We shall use the standard notation \x] and |_a;J 
to respectively denote the smallest integer not smaller than x and the greatest integer not larger 
than x. The cardinality of a set A shall be denoted as \A\. 

Next, we review the definition of the capacity region of a K-usei GIFC with power constraints 
P = (Pi, . . . , Pk) and noise variances <x = (erf, . . . , a\) that will apply in this paper. Fixing a 
block length n and a rate-tuple R\, . . . ,Rr, the random message Wi of the i-th transmitter is 
assumed to be uniformly distributed in the set Wi = {1, . . . , 2^ nRi ^ }. The messages are further 
assumed to be independent from one user to the next. A coding scheme consists of K encoding 
functions f±, . . . , fx where /j maps Wj into the n-dimensional ball of radius \fnPi of real vectors, 
the components of which specify the signal value xn that the i-th transmitter will send at each 
time indexj§ The set {/i(l), /i(2), . . . , fi{^ nRi ~^)} constitutes the codebook of transmitter i. There 
is also a corresponding set of K decoding functions g±, ■ ■ ■ ,gx where maps n-dimensional real 
vectors into the message set Wj. The function gi is applied by receiver i to the n received signal 
values yf = y^i, . . . ,yj )Tl , which, as specified in the introduction, are formed as a component- wise 
linear combination, according the gain matrix H, of the transmitted signals and Gaussian noise. 

Definition 1 The capacity region C(H, <r,P) of the GIFC is defined as the set of rate-tuples 
Pi, ... , Rk for which there exists a sequence of block length n message sets and power-constrained 
coding schemes satisfying linin^oo maxi<j<^ Pr(Wi ^ gi(y n )) = 0, where the probability of error 
is taken with respect to the distribution induced by the random messages, the coding scheme, and 
the channel, as specified above. 

The degrees-of- freedom DoF(H) of a GIFC, as defined in ([1]) above, will be assumed to be based 
on this formal definition of C(H, cr, P). 

3 Real, scalar GIFCs with exactly K/2 degrees-of-freedom 

In this section, we prove Theorem [1] demonstrating the existence of fully connected, real, scalar 
K-usei GIFCs with exactly K/2 degrees-of-freedom. An outline of the proof is as follows. First, 
we prove a simple lemma (which will also be useful in the next section) showing that DoF{H) = 
DoF(DtHD r ) for any diagonal matrices Dt and D r with positive diagonal components. This, in 
turn, implies that we can transform any H satisfying the assumptions of Theorem Q] to one with 

For simplicity, in this paper, we formally adopt a per codeword average power constraint, as opposed to the more 
conventional expected average power constraint. We note that the degrees-of-freedom of a GIFC can be shown to be 
the same under both types of power constraints. 
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irrational, algebraic numbers along the diagonal and integer values in off-diagonal components, 
while preserving the degrees-of-freedom. We then focus on coding schemes for the new H in which 
each transmitter is restricted to signaling over the scalar lattice {zP l / 4+e : z G Z} intersected 
with the interval [— P 1 / 2 , P 1 / 2 ]. The idea is that the integer valued cross gains guarantee that the 
interfering signal values at each receiver will also be confined to this scalar lattice (though may fall 
outside of the P 1 / 2 interval), while the irrational direct gains place the directly transmitted signal 
values on a scaled lattice that "stands out" from the interfering lattice. Specifically, this scaled 
lattice has the property that offsetting the interfering lattice (equal to the original lattice) by each 
point in the scaled lattice results in disjoint sets. A non-empty intersection would imply that the 
direct gain could be written as the ratio of two integers, which would contradict its irrationality. 
An even stronger property holds for algebraic irrational direct gains: the distance between any pair 
of points obtained by adding a point from the scaled lattice to a point from the interfering lattice 
actually grows with P. This is shown to follow from a major result in number theory stating that 
for any irrational algebraic number a and any 7 > 0, a rational p/q approximation will have an 
error of at least 5/q 2+1 for some 5 depending only on a and 70 The next step in the proof is to 
deal with the noise by coupling this inter-point distance growth with Fano's inequality to show that 
the mutual information induced between each transmitter-receiver pair by independent, uniform 
distributions on the original power-constrained lattices, taking interference into account, grows like 
(1/4 — e) log 2 P, for arbitrarily small e. This, in turn, implies the existence of a sequence of block 
codes (with symbols from the original lattice) with sum rate approaching (K/4) log 2 P, and which 
are correctly decodeable, with high probability, by treating interference as noise. 

As mentioned, we begin with an invariance property of DoF(H). 

Lemma 1 (Invariance property) For any matrix H and diagonal matrices Dt and D r with 
positive diagonal components DoF(DtHD r ) = DoF(H)^ 

Proof: Let Dt = diag(dfu ■ ■ ■ , dtx) D r = diag(<i r i, . . . , d r jc). In the matrix multiplication DtHD r , 
dti scales the channel gains from transmitter i to the different receivers, while d r j scales the channel 
gains from all transmitters to receiver j. By scaling the input signals and noise variances instead 
of the channel gains, we can write 

C(D t HD r , 1, PI) = C{H, l l D; 2 , P^D 2 ). (2) 

Let dt = xnhLi<i<K dti, dt = maxi<j<^ da, d r = mhii<j<x d„, d r = maxi<j<xd r j. Since increasing 
the power constraints and reducing the noise variances cannot reduce the capacity region of the 

4 For irrational algebraic numbers of degree two (solutions to quadratic equations with integer coefficients), such 
as \/2, the approximation bound holds with 7 = and is known as Liouville's Theorem (established in 1844). The 
validity of the bound for general algebraic numbers was a longstanding open problem in number theory and was 
finally established in 1955 by K. F. Roth, for which he was awarded the Fields Medal. 

5 The matrices D t and D r need only have non-zero diagonal components for the result to hold. We assume 
positivity for simplicity, as this is all we shall require in this paper. 
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GIFC we have 



C(H,(l/d 2 r )l,Pd 2 l) C C(tf,l'L>- 2 ,Pl'Z)2) C C{H, (1/^)1, Pd? t l). (3) 

Furthermore, once all the noise variances are equal, they can be normalized to 1 by scaling the 
power constraints, leading to 

C{H, (l/d 2 )l, P<%1) = C{H,l,Pd 2 d 2 l) 

C(H, P%1) = C(H,1,P%%1). (4) 

Using ((21), ((3|) and dU) we can write 

max RgC(g,i,pJ2j2 1) 1*R max R£C ( jDtHr , flili p 1 ) l f R max R £ e(ff,i,P<^1i) 1R 
ilog 2 P " fbg^P " ±log 2 P 

which can be rewritten as 

| log 2 {Pd 2 d 2 ) max RgC (H,i,p^i) ltR 
±log 2 P l\og 2 {Pd? r d 2 ) 

m ^RGC(D t HD R ,l,P) 1*R I \Qg 2 {Pd 2 r d 2 t ) maX RgC(H,l,P^d1l) 

±log 2 P " ilog 2 P ±log 2 (P<&£) 

Taking limsup of all three terms as P — > oo implies DoF(H) < DoF(DtHD r ) < DoF(H). ■ 

Proof: (of Theorem [1]) By Lemma Q] we can scale P (by post multiplying by an integer valued D r ) 
so that all off-diagonal elements are integers and all diagonal elements remain irrational algebraic. 
In addition, from (fTJ) we only need to consider channels where all the inputs have the same power 
constraint P and all the noise processes have variance 1. 

For any e > 0, we will present a communication scheme that achieves 1*R = (K/4 — Ke) log 2 P — 
o(log 2 P), implying that DoF(H) > K/2. Consider the scalar lattice 

A Pj£ = {x:x = P 1/4+t z,z£ Z} 

and let C P , e = A Pi€ n [~VP, xfi 5 }. Note that 

+ 1 < 2P 1 / 4 " e + 1. (5) 

The users communicate using codebooks of block length n, obtained by uniform i.i.d. sampling 
Cp )£ . Note that due to the truncation of the lattice to the interval [— \fP, \/P], the symbol power 
(x 2 t ) never exceeds P at any time index, and hence the average codeword power does not exceed 
P. Each receiver decodes the signal of its transmitter, treating the interfering signals as i.i.d. noise. 
With this scheme, as n — > oo we can achieve: 

Ri = I{Xi- Yi) = H(Xi) - H{Xi\Yi) , i = 1, . . . , K, 



\Cp,e 



P 
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where Xj ~ Uniform(Cp e ), Yi = J2f=i hji-^j + Zi, and Z\ ~ A/"(0, 1), i = 1, . . . , K. 
First we note that H{Xi) = log 2 |Cp, £ | ~ (3 — e) log 2 P + log 2 2. We will show that 

limsup H(Xi\Yi) < 1 , i = 1, . . . ,K, 

P^oo 

and as a result, R{ = (j — e) log 2 P — o(log 2 P) can be achieved. It would then follow that DoF(H) > 
K/2, and, from the upper bound of [8], that DoF(H) = K/2. 

We will use the following lemma to upper bound H(Xi\Yi) for i = 1, . . . , K . 

Lemma 2 Let Ep ;6 = {ax + s : x G Cp e , s G Ap e } } with a being any real, irrational, and algebraic 
number. For any y G Ep e i/iere exists a unique pair (x, s) G Cp e x Ap e such that y = ax + s. In 
addition, ifyi,yz G Ep >e , yi 7^ y2, i/ien |yi — ya| > P e /or a^y yiwen e > and large enough P. 

Proof: Let y = ax + s with x € Cp £ , s G Ap e . To get a contradiction, assume that there 
exists (x,s) G Cp £ x Ap e with (x, s) 7^ (x,s) such that ax + s = y. Without loss of generality 
we can assume x > x. Since a/Owe have s 7^ s and x, > x. In addition, since by assumption 
ax + s = ax + s, we have 

a - g ~ ^ - ~ Zs)P 1/4+e _ Z s - Z' S ^ 

X X (z^ Z X )P"'"^~'~ € Z x z x 

where z x , z x , z s , z§ G Z, which contradicts the assumption of irrational a. 

To prove the second part of the lemma, let y = ax + s, where x G Cp, £ , s € Ap e , and y G Sp j£ , with 
y 7^ y. If s = s then 

|y — y| = a|x — x| = ajz^ — z x \P 1 l i+e > P e , 

where z x ,z x G Z, as long as P is sufficiently large. Similarly, if x = x and P is large enough we 
have 

\y-y\ = \s-s\ = \z s -z a \pV 4+e > P\ 

where z s ,Zj 6 Z, So it remains to consider the case x 7^ x and s 7^ s. Without loss of generality 
we can assume x > x. To get a contradiction, we assume that |y — y| < P e , and write: 

\y-y\ < P e 

\ax + s — ax — s\ < P e 

P e 

\az x + zs — az x — z s \ < 



Zs Z^ 

a 



p-l/4 

< — — , (6) 

z x z x 

where z x , z x , z s , z s G Z. 

On the other hand there are bounds on how well an irrational algebraic number can be approximated 
with a rational number. The most refined of those bounds, due to Roth, 1955, states that for any 
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irrational algebraic a, and any 7 > 0, there exists 5 > such that 

6 



a 



> 



j2+7 



(7) 



for allp,q G Z, g > [E]. 
Combining ([6]) and ([7|) we have 



so that 



< 5 < P- 1 / 4 ^ 



(a) 



< 



Z.5 



a 



< 



p-1/4 



\l+7 



7 < p-1/4 / 2 pl/4-6 + 1 



1+7 



2 l+7p7/4-e(i+7) _|_ n\ 



(8) 



where we used ([5]) in step (a) . But the right hand side of ([8]) goes to as P — > 00 whenever e > 1/4 
or 7 < e/(l/4 — e). Since we can choose any 7 > 0, we can obtain a contradiction in ([8]) for any 
e > 0, for large enough P. ■ 

We will use Lemma [2] to build an estimator that can identify Xj in with high probability. 

Let Si=^2j^ihjiXj, and note that since hji G Z for j we have that 5j G Ap )C . In addition, let 
Sp,e,i = {hux+y : x G Cp i€ , y G Ap e }, and let : Cp, e X Ap e — ► £p, e ,j be defined as Vi(x, s) = hux+s. 
Using Lemma [2] and the fact that hu is real, algebraic and irrational we have that Vi is invertible, 
i.e. there exists v^ 1 : Xp, e ,i — ► Cp e x Ap e such that (v{(x, s)) = (x, s) for any (x, s) G Cp e x Ap )£ . 

Let u : M 2 — > R be defined as u(x,s) = x, and let Xi = w(t>~ 1 (argmin rg E PE t \ x — We have 

Xi 7^ whenever is closer to some other point in Sp e ^ than it is to /ijjXj + From Lemma 
[2J this can only occur if |Zj| > P £ /2 for large enough P. It follows that 



Pr(X^X 4 )<Pr [\Zi\ > 



P c 



2 QaT(0,1) 



< 2exp 



P 



2c 



where Qjv(o,i) 0*0 is the probability that a Gaussian random variable with zero mean and variance 
one exceeds x. Using the data processing and Fano's inequalities we obtain 

H(Xi\Yd < H(Xi\Xi) 

< l + Pr(Xi^Xi)log(\Cp, e \) 



p2e 

1 + 2 exp I — 



1 

4" £ 



log 2 P + log 2 2 + o(i; 



(9) 



which goes to 1 as P — > 00. 



4 Degrees-of-freedom for rational H 

In this section, we give the proof of Theorem [2j establishing that the degrees-of-freedom of any 
fully connected, real, scalar GIFCs is bounded strictly below AT/2, for K > 2. As in the previous 
section, we begin with a sketch of the proof. 
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Most of the work in the proof is to establish the theorem for K = 3 users. The theorem for K > 3 
will then follow from an extension of the averaging argument of [8], used therein to obtain the K/2 
degrees-of-freedom upper bound from a bound of 1 on the degrees-of-freedom for K = 2. In this 
case, the K = 3 bound is averaged over all three-tuples of users (transmitters and corresponding 
receivers), as opposed to pairs of users in [8]. 

Given a K = 3 user GIFC with fully connected, rational H, using Lemma [T] (invariance property) 
and eliminating cross links, we can upper bound DoF(H) by DoF(H) where 

1 " 
1 p , 
1 q 1 _ 

where p and q are integers (see Figure [2] in Subsection I4.3p . The main step in the proof of the 
overall theorem is establishing that DoF(H) < 3/2, which is formally carried out in Lemma [TT1 
below, and proceeds as follows. First, it is shown (LemmaU]) that a deterministic channel obtained 
by eliminating all noise sources and restricting the power-constrained codewords to have integer 
valued symbols results in at most a power-constraint-independent loss in the achievable sum rate. 
Therefore, the degrees-of-freedom (according to the obvious generalization) of this deterministic 
interference channel (IFC) is no smaller than DoF(H). Next, it is shown using a Fano's inequality 
based argument that if the degrees-of-freedom of the deterministic IFC is at least 3 /2 there would 
exist finite sets of n-dimensional integer valued vectors X2 and X% such that the corresponding 
independent random variables x 2 and X3, uniformly distributed on these sets, induce discrete 
entropies satisfying H(x%) » n(l/4) log 2 P, H(x%) « n(l/4) log 2 P, fl"(xg + xg) ps n((l/4) + 
e) log 2 P, and H(p ■ x 2 + q ■ X3) « n((l/2) — e) log 2 P, for the integers p and q defining the channel 
and e arbitrarily small. These entropy relations suggest that the cardinality of the support of 
p ■ x 2 + q ■ X3 is much larger than that of x 2 + X3. However, tools from additive combinatorics 
(through Lemma [7]) can be used to show that this is impossible for integer valued p and q, leading to 
a contradiction, and thereby implying that the deterministic channel must have degrees-of-freedom 
strictly smaller than 3/2. Unfortunately, the link between the entropy and the cardinality of the 
support of a sum of independent, uniformly distributed random variables is sufficiently weak that a 
somewhat more involved argument (incorporating LemmafTUland Theorem [3]) is ultimately required 
to reach the above conclusions. The overall intuition behind the proof, however, is as outlined. 

The rest of the section is organized as follows. Subsections l4.1l and l4.2l respectivelv collect supporting 
results of an information theoretic nature and results from additive combinatorics. The proof of 
the main lemma on the K = 3 user IFC is presented in Subsection 14.31 Finally, the extension to 
K > 3 is presented in Subsection 14.41 Throughout, the capacity region of a iT-user GIFC will be 
taken as in Definition [TJ 



H = [kj] = 
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4.1 Supporting information theoretic results 

Lemma 3 The capacity region of a K-user memoryless ZFcJ§ where the codebook of user i is 
subject to an average power constraint Pi, is given by the limiting expression: 

Cifc=[) U {rgR* :Ri <^IW;y?),i = l,...,K\ (10) 



n=l 

Pr(\\xV- \\%<nPi)=l , i=l,...,K 



Proof: The lemma can be proved by extending the argument of [23J to K-user memoryless IFCs 
with possibly continuous alphabets and average power constraints on the inputs. The details are 
omitted. ■ 



Lemma 4 Given a gain matrix H and power constraints P = (Pi, . . . ,Pk), let Cd(H,P) denote 
the capacity region of the deterministic IFC defined by 

K 

where the inputs are constrained to be integers (i.e. x~i{t) G Z, i = 1, . . . , K, t = 1, 2 . . .) and satisfy 
an average power constraint ^ Ylt=i ^«(*) 2 — Pi f or a ^ 

Then, R € C(#, 1, P) => (R - A) G Co(iT, P) mtt A = (<5i, . . . , ), <5j = § log 2 (l + 2 £f=i /ij.), 
i = 1, . . . , K, where C(H, 1, P) is the capacity region of the corresponding GIFC (see Definition^. 



Proof: Let / : M -» R be defined as /(x)=|xj • l(x > 0) + [ar] • l(x < 0), and let g : E -> R 
be defined as #(x) = x - /(x). In addition, let Xji = f(xi), x i2 = g(xi), yn = J2f=i hjiXji + z a , 
and = J2j=iK hji x j2 + z i2i where zn, z^ ~ A/"(0, 1/2) are independent. Then the outputs of the 
K-user Gaussian IFC can be written as yi = yn + yi%, for % = 1, . . . , K (see Figure [1]). 

If R = (Ri,...,Rk) G C(H, 1,P), then for any 77 > there exists a family of codebooks 
{Ci >n , . . . , CK,n\n satisfying the average power constraints, and decoding functions {<7i, n , . . . , gK,n}n 
with average decoding error probability going to as n — > 00, such that lin^^oo ^ log 2 \C^ n \ > 
Ri — r\. For block-length n we have: 

n(Ri - i]-e n ) 

<^(x?;y?) (11) 
</(*&,*&;ya,yS) (12) 

=My&,y&)-My&,y&Wi,*&) 

6 Here we are considering more general IFCs than the Gaussian case. Definition \T\ still applies, but with the 
appropriate conditional probability distribution of channel outputs given channel inputs. 
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ZK2~5V(°.i) 



Figure 1: A decomposition of a K-user Gaussian IFC. 



K 



K 



(13) 



(14) 



=h(yl) + fc(y£|y?i) - h iJ2 + E + *o 

V 3=1 3=1 / 

/ K \ / K K 

<My?i) + &(y£) -ME ^ x £ + ^ - >* E ^ x £ + z « E ^ x ?i + z ^ 

V 3=1 / V 3=1 3=1 

<My?i) + fc(y£) " fc (E ^ x "i + z &) 

— h I hjiXj 2 + z i2 E ^' iX jl + Z il ' X 12 ' • • • ' X K2 J 

V 3=1 3=1 / 

=My?i) + ( E ^ x £ + - h ( E ^ x ;i + z ^i) - ^ 

\ j= i 

( K \ 
<I(x&;y&) + -log 2 l + 2£fcj 

v i=i y 

where e n — » as n — ► oo. We used Fano's inequality in (fTT|) . the data processing inequality in (fT2|) . 
the fact that conditioning reduces entropy in (|13p and (|14|) . the Gaussian bound for differential 
entropies in ([To]) , noting that |a3j2 (*) I < 1, i = 1, . . . , n, and used the data processing inequality in 
([16]), where we defined yfi=Y^j=i hj{x^ v 

Since ||xS||| < ||x^||| < nPi when the channel is used with codebooks satisfying the power con- 



</(x? i; y&) + -log 2 



n ( 1 
-log 2 ( 2vre- 



(15) 
(16) 
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straints, it follows that the codebooks {Ci ;U , . . . , CK,n}n induce distributions on x" 1; i = 1, . . . , K , 
that satisfy the power constraints {Pi}f = i- These distributions can be used in (fTUj) to conclude 
that (R\ — 77 — 61, . . . , Rk — f] — o~K, • • •) is an achievable rate vector in Cd(H, P). Since r/ > is 
arbitrary, the result follows. ■ 



Lemma 5 Let {(Cj )n , gi^ n: P e ,i,n)}iLi denote a block length n coding scheme for a K-user interfer- 
ence channel satisfying average (per codeword) power constraints {P{\^ =1 with rates {Ri}fL 1 and 
average error probabilities {P e ,i,n\f = \- If {Ci,ni • • • , C/f,n} is any set of codebooks with Ci iTl C Cj >n , 
\Ci,n\ > \Ci tn \/aii and a.i > 1, for all i = 1, . . . , K , then {(Ci, n , 9i,n, P e ,i,n)}fLi is a coding scheme 
with rates no smaller than {R4 — — log 2 oti}f =l and average error probabilities {-Pe,i,n}|Li satisfying 
Pe,i,n < (IlfLi a j)Pe,i,n, and also satisfying the power constraints {-Pj}^. 



Proof: Since Cj jn has rate Ri, we have that for i = 1, . . . ,K the rate Ri of Cj jn satisfies: 

Ri = - log 2 \C it n\ > - log 2 ( ] = Ri-- log 2 a>i. (17) 

n n \ 014 J n 

During communication with the codebooks {Ci ;Tl , . . . , Ck,u\ the transmitted messages (and hence 
the codewords) are chosen uniformly and independently, and as a result we have: 



Pe,i,n — x ^ ] '"' Pe,i,n( c li ■ ■ ■ > c k) 

llj=i \ c j,n\ Cl 6Ci,„ c K eC K>n 

~ X\ K \c- I ^ ^ ^ e ,i,n(ci,...,C K ) 



1 n 7 =i « 



> 



Z "' Z P e,i,n(ci,...,C/<) 



1 1 



1 



fe,i,n (18) 

lli=i ay 

where we denoted by P e ,i,n( c i, ■ ■ ■ ? c if) the probability of decoding error when the codewords 
Ci, . . . , ck are transmitted. 

Finally, since every codeword of Cj >n satisfies the power constraint P^, the codewords of Q n satisfy 
the power constraint Pj. ■ 



Lemma 6 Any achievable rate vector in a K-user IFC can be achieved by codebooks with no re- 
peated codewords, i.e. for every n = 1,2,... and k = 1,...,K, the codebook Ck, n is such that 

Cj, Cj £E Ck,n ^ Cj 7^ Cj. 
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Proof: Since (Ri,...,Rk) is achievable, for any r/ > there exists a family of codebooks 
{Ci jra , . . . , CK,n}n with i log 2 |Cj >n | > Ri — T] satisfying the average power constraints, and a family 
of decoding functions achieving average error probabilities P e ,i, n going to as n — > oo for every 

i = i,...,jr. 

Consider the single user channel between transmitter i and receiver i obtained from the interference 
channel by removing all the interfering signals at receiver i. Since the interference cannot help 
receiver i decode the message of its own transmitter, it follows that Ri can be achieved in the 
single user channel with the family of codebooks {Gj >n }j in , for some decoding functions {<^ n }i, n 
with average error probabilities no larger than Pe,i,ni % — b • • • > 7\", ^ — 1, 2, . . .. Let Ci^n ^ G{^ n 
be obtained by removing the worst (i.e. leading to the largest error probability in the single user 
channel) half of the codewords in CV„. It is easy to see that Cj )ri and g[ n achieve a maximal error 
probability in the single user channel no larger than 2P e ^^ n , and in particular, d, n has no repeated 
codewords for n large enough. The result follows by using Lemma [5] with {Cj )n }j jn as defined here, 
and en = 2, i = 1,...,K, noting that as n — > oo, ^ log 2 aj — > and (TljLi °tj)Pe,i,n - > for 
i = l,...,K. U 



4.2 Supporting results from additive combinatorics 

Given an abelian group G and two sets A, B C G let A + B denote the set of sums obtainable by 
adding one element from A to one element from Formally, A + B = {a + b : a G A,b G B}. 
The set of differences A — B can be defined analogously. For any integer p and set A C G, we 
denote by p ■ A the set consisting of all p- multiples of elements of A or p ■ A = {pa : a £ A}. 
For a non-negative integer p and A C G we denote by p * A the set of p-fold sums of A or 
p-k A = {ai + a-i + . . . + o p : Oj € A for i = 1, . . . ,p}. We shall also require the concept of a partial 
sum set. Given A, B C G, let F C yl x 5. The partial sum set of A and -B with respect to F, 

F F F 

denoted as A + B, is defined as A + B = {a + b : (a, b) £ F}. If F = A x B, then A + B = A + B. 
We shall later need the following result on sum sets. 

Lemma 7 Let p, q G Z and K G R loii/i K > 1. 7/ |A + B| < i^AI 1 / 2 !^ 1 / 2 i/zen |p • A + g • B\ < 
K dte,q)\ A \l/2\ B \\/2 f or d fa q j = 2max{|p|,|g|} +5. 

Our proof, stated below, follows fairly standard arguments from additive combinatorics (see Chap- 
ters 2 and 6 of |20j) and is based on the following key results concerning non-empty subsets 
A,B^G. Proofs of these results can be found in |20| . 

Lemma 8 (Rusza's covering lemma) There exists a subset X C B such that \X\ < \ A + B\/\A\ 
and BCA-A + X. 

7 We shall apply these results to the group of vectors of integers with component- wise addition. 
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Lemma 9 (Pliinnecke-Rusza inequality) For positive integers p,q and any real valued K > 1, 
if\A + B\ < K\A\ then \p*B -q*B\< K p+q \A\. 

Proof: (of Lemma [71) By Lemma [HJ there exists a set X C q ■ B with \X\ < \A + q ■ B\/\A\ 
satisfying q-B C A — A + X. This, in turn, implies that p- A + q- B (1 p- A + A — A + X. Therefore, 

\p-A + q-B\< \p- A + A- A\\X\ 

<\p-A + A-A\ lA+ ^' Bl , (19) 

where we have used the trivial inequality \S + T\ < \S\\T\ in the first step. Again, by Lemma 
there exists a set Y C A with |Y| < |A + satisfying A C B — B + Y , which implies that 

A + q- BOq-B + B — B + Y. Proceeding as above, we then have 

\A + q-B\< \q-B + B-B\\Y\ 

<\q. B + B-B\^±^. (20) 

Combining (fT9l) and (l20j) gives 

b-A + g-S|< |p- A + A .B + B-B|I^±^i 

<\p-A + A-A\\q-B + B- B^A]- 1 / 2 ^- 1 / 2 (21) 
< |p*A + A- A\\q* B + B - B^A^ 1 ' 2 ^- 1 / 2 (22) 

where (|2ip follows from the assumption of the lemma and (|22p follows from the trivial inclusions 
p - A + A — A <Z p* A + A — A and q ■ B + B — B<^q*B + B — B. Rewriting the assumption of 
the lemma as \A + B\ < K\A\- 1 I' 2 \B\ X I 2 \A\, we can apply Lemma[9]with K = i^|A| _1 /2 |^|i/2 to 
conclude that 

\q * B + B — B\ < K M+ 2 \A\-^+ 2 y 2 \B\^+ 2 y 2 \A\. (23) 

Similarly, rewriting the assumption of the lemma as \A + B\ < i^AI 1 / 2 ^! -1 / 2 !!?!, we can apply 
Lemma with the roles of A and B switched, to obtain 

\p*A + A-A\ < ^IpI+2| A |(H+2)/2|£|-(|p|+2)/2|^ ^ 

Combining ([22D with ([23]) and ([MD gives 

\p- A + q- B\ < ^IpI+2| A |(H+2)/2| B |-(| P |+2)/2 k M + 2| A |-(|,|+2)/2| B |(| 9 |+2)/2 K |^|1/2| B |1^ 

< ^2max{|p|,| g |}+5|^|l/2|_g|l/2^ ^5) 

where ([25]) follows from K^- 1 / 2 ^ 1 / 2 > 1 and K^ 1 ! 2 ^- 1 / 2 > 1, which in turn follow from 
the lemma's assumption \A + B\ < K\ A^ 1 / 2 ^ 1 / 2 ^ = K^ 1 / 2 ^- 1 / 2 ^ together with the 
obvious relations \A + B\ > \A\ and |A + i3| > \B\. The lemma is thus established with d(p,q) = 
2max{|p|,|g|} + 5. ■ 
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Remark 1 A considerably smaller d(p,q) for larger p and q can be obtained by applying the bounds 
of 121)1 to the factors \p- A + A — A\ and \q- B + B — B\ appearing in \2 These bounds are obtained 
through a more sophisticated application of Lemma\f^ that takes greater advantage of the structure 
of sets like p\ ■ A + ... + p m • A. The resulting d(p, q) grows logarithmically in \p\ and \q\. It is likely 
that a direct application of the technique of I21f to p ■ A + q ■ B would even further improve d(p, q) 
for large p and q. 

We shall also make use of the following lemma relating the entropy of a sum of uniformly distributed 
independent random variables to a partial sum set involving their supports. 

Lemma 10 Let X and Y be independent uniform random variables with support sets A C G and 
B C G for some abelian group G, with \A\ > \B\, such that 

H(X + Y) < (l + e)log 2 |^| 

for some e > 0. Then, for any given c > 1 there exists a set F C A x B such that 

\ F \ > \A\\B\^ and \A + B\< (jA^+^B]- 1 ^ lA^B] 1 ' 2 . 

Proof: Define T(s) = {(a,b) <G A x B : a + b = s}. Define S={s : \T(s)\ > \B\\A\~ ce } and 

F 

F= {(a, b) £ A x B : a + b £ S}. From these definitions we have \A + B\ = \S\. In addition, 

\A\\B\ > ^|T( S )| 

> \S\\B\\A\-~, 

where the last step follows from the definition of S. As a result, \S\ < |A| 1+Ce , giving the required 

F 

upper bound for \A + B\ = \S\. 

To get a lower bound on \F\ we start by rewriting it as follows: 

\F\ = Yl ^{x,y)€F) 

(x,y)<=AxB 

= \A\\B\Pr((X,Y) £ F) 

= \A\\B\Pr(X + Y G S) 

= \A\\B\[l-Pr{X + Y £ S c )] 

= \A\\B\[l-Pr(\T(X + Y)\<\B\\A\- C6 )]. (26) 

The probability term can be upper bounded using Markov's inequality, noting that \B\ > \T(s)\ 
for all s. 

Pr[\T(X + Y)\<\B\\A\-<*) = Prolog, [r( ^) y) | >celog 2 |A|) 
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E 



log 



< 



\B\ 

2 \T(X+Y)\ 



celog 2 \A\ 



(27) 



To bound the expectation in (|27p we note that for any (x, y) € A x B, Pr(X = x,Y = y) = 
and expand I(X + Y; Y) in two different ways 

I(X + Y;Y) = H(X + Y)-H(X + Y\Y) = H(X + Y)-H(X) = H(X + Y)-\og 2 
< elog 2 |A| 

I(X + Y;Y) = H(Y)-H(Y\X + Y)=log 2 \B\-E[-log 2PY]x+Y (Y\X + Y)] 

P y,x + y(Y,X + YY 



i 



log 2 \B\ - E 



\og 2 \B\-E 



log. 
log 2 



2 Px+Y (X + Y) 

1 |A||g| 
\A\\B\\T(X + Y)\\ 



E 



log 



151 



to obtain 



2 |T(X + Y)| 
log 2 



\B\ 



Prom (1251). and (ESI) we obtain: 



|T(X + Y)| 



|F|>L4||£| 1 



< elogo |A| 



(28) 



Finally, we shall also rely on the following important theorem from additive combinatorics, as stated 
in [20], relating partial sum sets to full sum sets. 



Theorem 3 (Balog-Szemeredi-Gowers theorem) Let A C G and B C G for some abelian 
group G and let F C A x B be such that 

\F\ > \A\\B\/K and \A + B\ < K'\A\ 1/2 \B\ 1/2 
for some K > 1 and K' > 0. Then there exists A' C A, B' C I? smc/i i/tai 



4^2K 

\A' + B'\ < 2 12 K b {K'f\A\ 1 ' 2 \B\ 1 ' 2 . 



\Bf\ > H 
1 1 - 4K 



Theorem [3] is proved in Chapter 6 of [20J. 
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Figure 2: A three-user Gaussian IFC with channel matrix H. 



4.3 Main lemma 



In this subsection, we state and prove the main lemma at the core of our proof of Theorem [2j 



Lemma 11 Let p,q £ Z, p,q ^ 0, and 

H = [kj] 



10 
1 p 
1 q 1 



with the corresponding GIFC depicted in Figure^ Then DoF(H) < | — e(p,q), with e(p,q) > 0. 
In particular, this holds for 

eO, q) 



1 



12d(p,q) +2 : 



where d(p, q) is as in Lemma 



Proof: We start by extending the definition of degrees-of-freedom to deterministic interference 
channels. Consider a K-user deterministic interference channel with input and output alphabets 
{Xi&My^i, defined by 

= Vi(xi(t), . . .,x K {t)) ,i = l,...,K;t = l,2,..., 

where for each i, xi{t) G Xi must satisfy an average power constraint Ylt=i x i(^) — n -^ > ' an d 
Vj : X\ x • • ■ x Xk — > is a deterministic function. Let C(P) be the capacity region of the channel 
with power constraint P. We define the degrees-of-freedom of the deterministic channel by 

A max ReC( p) 1*R 

DoF= lim sup — ; — . . , — — . 

(l/2)log 2 P 

Due to Lemma HI the degrees-of-freedom of the GIFC with channel matrix H is upper bounded by 
the degrees-of-freedom of the deterministic channel 



Uiif) = ^hjiXjit) , i = 1,2,3, 

3=1 



(29) 
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where Xi(t),yi(t) £ Z, and where the channel inputs are subject to the average power constraint 
■^J2t=i x !(t) — P> f° r * = 1)2,3. We will prove by contradiction that the degrees-of-freedom of 
this deterministic channel is strictly smaller than 3/2. Therefore, to get a contradiction, we assume 
that as P goes to infinity, there are achievable rates (Ri(P), P 2 (P), i?3(P)) satisfying 

Pi(P) + P 2 (P)+P 3 (P) = 3 

2' 



lim sup 

P->oo 



(l/2)log 2 P 



This implies that there exists an increasing sequence of power constraints {P m }^ =1 with lim. 



p 



oo, such that 



lim 

m— >oo 



R^Pm) + R 2 (P m ) + R 3 (Pn 



3 
2' 



(30) 



(l/2)log 2 P m 

For a power constraint P m , consider a set of codebooks {Xi in>m , X2, n ,mi ^3,n,m} of block-length 
n with rates {Ri, m , R2,m, R3,m} in the deterministic channel (f29l) . and with decoding functions 
,n,m) 92,n,mi 93,n,m} that achieve average error probabilities {P e ,i,n,m, P e ,2,n,mi Pe,3,n,m}- We as- 
sume that {Pi, m , i?2,m, P3,m} satisfy ([3D]) as m -> oo and that {P e ,i )rilTO , P e ,2,n,m, P e ,3,n.,m} go to as 
n — > oo for fixed m. In addition, let x™, £ = 1,2, 3, be the independent random vectors induced by 
the codebooks resulting from the uniform distribution of the messages. It follows that X,n,m £ Z n 
is the support set of x™, £ = 1,2, 3. Due to LemmaHwe can assume without loss of generality that 
the codebooks do not have repeated codewords. This implies that each x™ is chosen uniformly in 



<^i,n,m> or equivalently that Pr(x") 



r , for x" G Xi 



1,2,3. 



Using Fano's inequality we write (where, for simplicity, we suppress the dependence on m of the 
variables Xi,X2, etc.). 



n{Rl,m + R2,m + R3,m ~ 0~n) < 



3 

£ 

i=l 



1=1 



= ff(x? + x£ + x£) - + xj) + H(px% + gx?) 

-H(q^) + H(tf) 
= HW+y% + x%)-H(y% + x%)+H(py% + qx%). (31) 

To handle the first term of (|3ip . we use the following lemma, which follows from exercise 8.7 of [18] 
and Jensen's inequality. 

Lemma 12 Let X n = (X 1} . . . , X n 



n 



H(X n ) < - log 2 



be a discrete random vector on W 1 . Then, 

fiJ^ iM 

2ire 



IV> r (.Y i) + i 

\ 1=1 J 



where Var(Xi) = E(Xf) — E 2 (Xi) is the variance of X{. Therefore, using the independence among 
the input signals and the fact that Yut=i V ar i x i,t) — EW^W^ — we obtain 



n 



pr(x? + x^ + x^)<-iog 



2vre 3P m + 



(32) 
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The remaining two terms in (|3ip will be bounded in two different ways. First we use the following 
simple bounds on H(aX + bY), valid for any a, b ^ and independent random variables X and Y: 



H(aX + bY) < H(aX + bY, aX) = H(aX) + H(aX + bY\aX) = H(X) + H(Y) 
H{aX + bY) > H(aX + bY\bY) = H(aX) = H(X). (33) 

Using these bounds we get 

HtpxS + qxft-HW + xZ) < fl-(x5)+H(xJ)-max{ff(x5);ff(xJ)} 

= log 2 min{|-^ 2 ,n,m 1 1 \x 3 ,n,m 

I}- (34) 

We define / m j n (n, m) such that log 2 min{|A? 2)rijm |; |«*3,n,m|} = (3 + fmin(n, m))nlog 2 P m . Note that 
by the assumed dependence of Xi, n ,m on Pj, linv^oo f m i n (n, m) exists for each to. Then, using 
HMD, dSIl), (EZJ), and ([Ml), we have 

3 ,. lim„^ 

,m On) 

hm 



2 (l/2)log 2 P m 

, , , 1 , v • r^n^oo fmin(n,m)log 2 P n 

< H h hm mf ■ 



2 «woo (l/2)log 2 P, 
which implies 



liminf lim fmi n (n,m) > 0. (35) 

m— >oo n— >oo 

As a second option, we use Lemma [T2l to get the upper bound 



71 

P(px£ + <pc£)<-log 2 



2,T< ( (p 2 +g 2 )P m + l 



(36) 



which we use to get 



71 

HipxS + ^-HW + xS) < -log 2 



2:t< I (/ + (? 2 )P m + l 



log 2 max { I X 2 , n ,m \ \ | <*3,n,m I } • 

(37) 



We define f ma x(n,m) such that log 2 max{\X 2 , n ,m\; |<*3,n,m|} = (3 + fmax (n, m))n log 2 P m . Note, as 
above, that lin^^oo f m ax{n,m) exists for each to. Then, using (f30l) . (j3~Tl) . (f32j) . and (J37|), we have 

3 „ lim n 

— >oo \*l'l,m + P2,m + P3 ,m On) 

— hm 



2 m-»oo (l/2)log 2 P m 

— >oo J mai (n, to) log 2 P„ 

< 1 + 1 hm sup ■ 



2 m^oo (l/2)log 2 P, 
which implies 



lim sup lim f max (n,m) < 0. (38) 
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Since f ma x(n,m) > f m i n (n,m), d35j) and fl38|) imply 

lim lim f max (n, m) = lim lim f min (n, m) = (39) 



m^oo n— >oo m— >oo n— >oo 



and, as a result, for any £ > 0, there exist mo(£) and no(£,m) such that, |/mm(^)^)| < £ and 
I /max (h, m) | < £ for all m > mo(£) and n > no(£,?n). Therefore, for any £ > 0, m > mo(£) and 
n > n o(C> m ) we have 

~ - M ralog 2 P m < log 2 (min{|Af 2 ,n,rrt 1 1 1*3 |})< 

< log 2 (max{|^ 2 ,n,m\ ) 1*3 |})< Q + eJnlog 2 P m . (40) 

We define g(n,m) > such that -ff(x 2 + x 3 ) = (1 + <?( n i m )) l°g2( max {l'^2,n,m|; |*3,n,m|})- Then, 
using fl21, (OH), ([MI), and ([321) we have 

— = lim -. — r 

2 m-+oo (l/2)log 2 "m 

^ 111 1 n r limsup n _ >0O 5(n,m)((l/4) + / ma;c (n, m)) log 2 P m , . 

< 1 + 1 U — iimsup ; — — — — (41) 

2 m _*oo (l/2)log 2 P m V ; 

which, together with (j39j) and the condition g(n, m) > 0, imply 

lim limsupg(n, m) = (42) 

m >oo n — >OQ 

and, as a result, for any e > 0, there exist mo(e) and no(e,m) such that, < g(n,m) < e for all 
m > mo(e) and n > no(e, m). 

Therefore, for any e > 0, m > mo(e) and n > no(e, m) we have 

^(x^ + x^) < (l + €)bg 2 max{|^ jniTn |,|Af 3in , TO |}. 
For any given c > 1, Lemma 1101 guarantees the existence of P Cj n,m C * 2 . n.m ^ *3,n,m Such that 



,n,m 11*3 ,n,m| , 

7? ano - 



*2,n,m +' *3,n,n 



<■ TV' IJV I 1 / 2 ! V 1 1/2 



with K = c/(c - 1) and i^ >m = (max{|* 2)n , ro |; |* 3 ,n, m |}) 1/2+C£ (min{|^ 2in>m |; |* 3 ,n, m |})- 1/2 . Using 
Theorem [3] with P C) n im , it follows that there exist X' 2 n m C X2,n,m-, and X^ nm C Xz,n,m such that 

(43) 



l ;k' 1 


> 


|*2,n,m| 

4^^ 


*3,n,m 


> 


I *3,n,m | 

4^/iFT 


+ *3,n,m 


< 


2 12 i^ 5 « im ) 3 |^ 2 , n 



(44) 

|V2|^ _ji/2 ( 45 ) 



./it 



From (f43l) and (|44l) and Lemma [5] it follows that the set of codebooks {*i, n ,mj * 2 n m , * 3 n m } 
has rates (Rx t m,R2,m ~ h log 2 (4\/2if )> P3,m — ^ l°g 2 (4v / 2^)), and using the decoding functions 
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{9l,n,m, 92,n,m, 93,n,m} the average error probabilities are no larger than {P e ,i, n ,m * 32-FT 2 , P e ,2,n,m * 
32K 2 , P e ^,n,m * 32K 2 }. Let x^x^Xg be the random vectors induced by the codebooks {Xi >n>m , 
^2,n,m-> ^3nm}' Using Fano's inequality and absorbing in 5' n all the constants that vanish with 
n — > oo we write: 



n(Rl, m + R2,m + R3,m ~ S' n ) < 1(3?] y? ) 

i=l 

= J2[HW)-H(y?\5%)] 



8=1 

=tf(x? + x^ + xj) - #(*2 + xj) + tf(px£ + gx£) - (gx£) + (x£) 
=tf(x? + x^ + xj) - tf(x£ + xj) + tf(px£ + (pcj). (46) 

To bound the first term of (|46p . as before, we use Lemma [T2l obtaining 

(47) 



77 

tf(x? + x£ + x£) < - log 



2vre f 3P m + ^ 

To bound the remaining two terms of (|46j) we use techniques from additive combinatorics to upper 
bound the cardinality of the support set of (px 2 + 9X3) in terms of the cardinality of the support 
set of (x§ + 

From Lemma [3 ([35]) . (|4"4"|) and (g5]) we have 

I < \v! + y' \d(p,q)\v' \[l-d(p,q)]/2\ x > \[l-d( P ,q)}/2 
\P ,a 2,n,m ^ y ^3,n,ml — l^.n.m ^ ^S.n.ml l^2,n,ml l^3,n,ml 

< |2 12 K 5 (K^ m ) 3 |^2, n ,m| 1//2 |' ; t'3,n,m| 1//2 | 

/.^ ,v [l-d(p,q)]/2 n -y I \ \l-d(j>,q)}/2 

I \-*2,n,m\ \ / \ A -3,n,m\ \ 



\ 4V2K J V 4^^ / 



(48) 



which together with the bound H{X) < log 2 results in 

H{p±2 + 9*s) < - log 2 \X 2 ,n,m\ + ^ lo g2 l«*3,n,m| + 3d(p, g) log 2 i^, m + -^c,p,g 

113 

= - log 2 \X 2 ,n,m\ + ^ lo £2 l<*3,n,m| + ^(p, q) (1 + 2ce) log 2 (max{ I X 2 ,n,m \ \ |<*3,n,m|}) 

3 

_ 2 q "> l0g2 ^ min ^ \ X 2,n,m\;\^3,n, m \}) + K c ^ q (49) 

where K CjP ^ q is some constant independent of n and m. 
On the other hand using (|43p and (|44|) we have 

^(x^+x^) > max{i/(x^);^(x^)} 

= log 2 (max{|^ n m |; |^ n m |}) 

> log 2 (m&x{\X 2 ,n,m\; \X3, n ,m\}) ~ log 2 UV2k) . (50) 
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Therefore 

H(P*-2 + 9X3) - H(X2 + X3) < - log 2 (min{|^ 2 ,n,m|; |<*3,n,m|}) ~ ~ lo g2 (max{ I X 2>n ,m I ; |<*3,n,m|}) 

3 

+ 2 d (Pil) i 1 + 2ce ) lo S2 (max{ | X 2 , n ,m \\ |^3,n,m|}) 

3 

~7; d (P'<l) lo §2 (min{|^ 2 ,n,m\ ) 1*3 ,n,m \}) + K' 
3 

< + -d(p,q) (1 + 2ce) log 2 (max{|^2,n,m|; |<*3,n,m|}) 
3 

-o^(p,g)log 2 (min{|Af 2 ,n,m\ ) 1*3 (51) 
where K' c is some other constant independent of n and m. Using the bounds in (|4U|) we obtain: 

H(pS% + qx%) - H(y% + x£) < 
which, together with (1461) and (|47j) . imply 

(l/2)log 2 (P m ) - 

I log 2 [27re (3P m + ^)] + [3£d(p, q) + 3cec?(p, g) Q + g)] log 2 P m + + ^ 

(l/2)log 2 P m 1 j 

which is strictly smaller than (3/2) for small enough £ and e, and large enough m and n. This 
contradicts (1501). 



3^/(/).r,) + W(p.,/) ( ~ + £ 



?? 



log 2 P m + i^ g , (52) 



We can refine the above analysis to find the smallest e(p, q) such that assuming that DoF(H) = 
3/2 — e(p, q) does not lead to a contradiction. The expression for e(p,q) in the statement of the 
lemma is obtained by avoiding bounds on -ff(x 2 + Xg) until the last step in the analysis. For 
example, Lemma 1T01 is applied with 

(fj(x_ n + x n ) \ 
— ff- — log 2 (max{|* 2 , n , m |; |* 3 ,„, m |}). 
log 2 (max{|-Y2,n,m|; \Xz,n,m\s) J 

Note that #(x 2 +X3 ) 7^ iT(x 2 +X3) and we still rely on the lower bound (|50p for the latter entropy 
The entropy i?(x 2 + X3 ) and cardinalities in the final expression are bounded by assuming that 
DoF(H) = 3/2 — e(p, q), which can easily be shown, following steps similar to those leading to (I40p . 
to imply 

ff (x5 + x 5 ) /I « 



n log 2 P \ 4 
and 

1 e(p,Q)\ , /-.n ^ log 2 (min{|^2,n,m|; |*3,n,m|}) . 

+ °U) - i 75 - 



4 2 / nlog 2 P 



log 2 (max{|A2 

,n,m I ) 1*3 ,n,m 

l» < (I I Ml ; „; |1 



n log 2 P \ 4 

The details are omitted. 
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4.4 Proof of Theorem [2] 



By assumption all the entries of H are non-zero and rational, and therefore, there exists a diagonal 
matrix D r with positive diagonal entries such that H=HD r has non-zero integer entries. From 
Lemma [1] it follows that 

DoF(H) = DoF(H). (54) 

Consider the channel formed by the transmitters and receivers of users i, j and k. Let H^j^ £ Z 3x3 
be the principal minor (matrix) of H corresponding to the (i,j,k)-ih rows and columns. Due to 
the independence of the signals of the different users it follows that 

max _ Ri + Rj + Rk = max Ri + Rj + Rk, (55) 

(Ri,...,R K )eC(H,l,Pl) (Rirf^R^eCiHij^Pl) 

i.e. the other users cannot help users i, j, and k to improve their rates. In addition, since interference 
cannot help a given receiver in decoding the signal of interest it follows that we can set some of the 
cross-gains in H^j k to zero without reducing the maximum achievable sum rate. More specifically, 
defining H{j ; k = [hij.k('m, n)] by /ijjfc(?7i, n)=hij tk (m, n) • l(m > n) we have 

max Ri + Rj + R k < max Ri + R~ + Rk- (56) 



Furthermore, it is easy to seqj that there exist diagonal matrices Dt, D r with positive diagonal 
entries such that /•>/ ll Kl j,l) r = Hi t j jk where 



H, 



i,j,k 



1 

1 Pi,j,k o 
1 Qi,j,k 1 



for some Pi t j t k,Qi,j,k £ Vi,i,kiQ.i,j,k 7^ 0- Using (i56j) . LemmaCQand Lemma HU we have 

DoF{H iyhk ) < DoF(H i>j>k ) = DoF(H i>j>k ) <-- e(p i>j)k , q i>j>k ) (57) 
where e(pij tk ,qij,k) > 0. 

Considering every possible subset of users {i, j, k} C {1, . . . , K} and adding the corresponding sum 
rates, the rate of each user appears ( J 1 ) times in the sum. Therefore, we have 

K - 1 )doF(H) { K - 1 )doF(H) 

max (/? R \ eC (B ,i ,Pi) { K 2 *) Ya=i R i 
= lim sup — ^ 

P-*oo 2 lo S2 P 

S{j,j,fc}C{l, ...,#} max (Ri,...,Rx)GC(5,l,^l) + R 3 + 



< lim sup 



P^oo o !Og 2 



4 log, P 



8 Letting Hi,j,k = [a-, 0, 0; b, c, 0; d, e, /] in Matlab matrix notation, we can choose D t = [bd, 0, 0; 0, ad, 0; 0, 0, ab] and 
D r = [l/(abd), 0, 0; 0, 1/a, 0; 0, 0, l/(o6/)]. 
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Figure 3: Three-user GIFC of the example in Section [5] 

(b) ,. T,{i,j,k}C{l,...,K} ^^^(RuRj^eCiSi^k^Pl) ( R i + R j + R k) 
= lim sup 4 

P^oo 5 log 2 P 

< £ DoF(Hij k ) 

{i,j,k}C{l,...,K} 

< (?)(§-')• 

where (a) is due to (|54|) . (b) follows from (|55|) . (c) is obtained from (|57j) . and where we defined 
<5=min{ i j ife }c{i,..,A:} e(Pi,j,k,qi,j,k) > 0. We finally obtain 



, , K K K 
DoF { H)<---5<-, 



(59) 



establishing the theorem. 



5 A 3-user rational GIFC example 



In this section, we will derive lower and upper bounds on the degrees-of-freedom of the 3-user GIFC 
with channel matrix 

^10 
H = 12 
1 1 1 

represented in Figure [3j This channel is a special case of the one considered in Lemma [TTT with 
p = 2 and q = 1. From this lemma, we obtain: 

3 1 



DoF(H) < 



(60) 



2 12(2(2,1) +2 

where d(p,q) = 2max{|p|, |g|} + 5 was obtained in LemmaEl For the special case of q = 1 the result 
of LemmaEJcan be easily strengthened to get d(p, 1) = 2\p\ +3. Evaluating (|60|) with d(2, 1) = 7 we 
obtain DoF{H) < 1.4884. It should be possible to improve this bound in a number of ways, such as 
by improving on d(2, 1) = 7, and possibly by improving on the power 3 in the term (K') s appearing 
in Theorem [3l Another possibility might be to forgo this theorem for a different approach, such as 
one based on Exercise 2.5.4 of |20|. 
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To get a lower bound on DoF(H), we will describe a communication scheme that aims to achieve 
good interference alignment at receiver 1 by aligning the interfering signals of transmitters 2 and 3, 
while achieving good separation at receiver 2 between the signal of transmitter 2 and the interference 
of transmitter 3. 

As was done in |10j . we design the communication scheme for a deterministic interference channel 
and later show how to extend the scheme to the Gaussian channel. We derive the deterministic 
channel from the Gaussian IFC by removing the Gaussian noise and constraining the inputs to be 
integers. Let A% = {0,1}, A<i = {0,2,4}, A3 = {0,2}, and Q = 8. We communicate information 
independently over L levels, without coding over time. The signal of user i at time t is given by: 

L 

Xi (t) = ^2m it e(t)Q e -\ (61) 
i=i 

where mn(t) S Ai is the message of user i in level £ at time t. 

Since A± + A2 + A3 = {0, 1, 2, 3, 4, 5, 6, 7}, the signal at receiver 1 can be written as 

3 L 



Vl (t) = J^xi(t) = Ys^A^Q^ 1 



i=l 



with w\^{t) = mn(t) + m,2/(t) + m 3 ^{t). Therefore, by computing the Q-ary decomposition of 
yi{t) we can recover the sums m\ i(t) + rri2,i(t) + 777,3 e(t) a t each of the L levels. In addition, since 
A2 + A3 = {0, 2, 4, 6} we have mi/(t) = l{w\/(t) G {1, 3, 5, 7}), so we can directly determine mi^(i) 
from wi t i(t). 

Similarly, at receiver 2 we compute 



y -^ = X 2 (t)+ 1 -X 3 (t) = f2w2At)Q e - 1 

1=1 

with W2/{t) = 7772 i |(t) + (l/2)777 3 ^(t) G {0,1,2,3,4,5}, from which we can compute nri2/(t) = 
W2,e( t ) ~ [ w 2,£(t) mod 2]. 

Finally, receiver 3 can directly recover m 3 /(t) at all levels from the received signal ys(t) = x^ft). 

To compute the achievable degrees-of- freedom of this scheme we note that since |xj (t)\ < Q L the 
transmission power at each transmitter is smaller than Q 2L . On the other hand the rate of users 
1 and 3 is L log 2 2 while the rate of user 2 is L log 2 3. Therefore, we obtain for the deterministic 

2L log, 2 + L logo 3 2 + logo 3 

DoF > ^— ^_ = 62 a 1.19499. 

flog 2 8 3 

We now informally argue that the same degrees-of-freedom can be achieved in the Gaussian channel. 
We essentially use the same multi-level coding scheme, but we now encode the signals of each level 
over long blocks of time. The lower levels may be severely affected by noise, but as the level £ 
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increases, the influence of the noise becomes smaller, ultimately being insignificant. As a result, 
the amount of redundancy that needs to be added to the signal of level t to ensure low probability 
of decoding error goes to as t grows to infinity. It follows that for i large enough, the achievable 
rates in the Gaussian channel at level I approach the achievable rates in the deterministic channel, 
and since the rates of the lower levels do not affect the degrees-of-freedom, we conclude that 
DoF(H) > 2+1 ° g2 3 (see jlOj for a similar argument). 

In summary, using the lower and upper bounds that we derived we have, 

1.19499 < DoF(H) < 1.4884. 

Remark 2 The achievable scheme that we described is simple to analyze because there are no 
"carry overs" across the different levels, and the signals and interference are "orthogonal" in the 
sense that there is no need to code over time to ensure reliable decoding in the deterministic channel. 
In choosing the sets A\, A2 and A3 we tried to obtain small \A2 + A%\ to align the interference at 
receiver 1 and simultaneously obtain large \2A2-\-A3\ to achieve good signal-interference separation 
at receiver 2. With these design guidelines, one could optimize the sets A\, A2 and A3 (with possibly 
larger Q) in order to improve the achievable degrees-of-freedom. In addition, Han-Kobayashi-type 
schemes f^j at each level where part of the interference is decoded and subtracted may result in 
better performance than purely orthogonal schemes. 

6 Conclusion 

We have shown that the degrees-of-freedom of K > 2 user, real, scalar GIFCs is sensitive to whether 
the channel gains have rational or irrational values, and it is, in fact, discontinuous at all fully con- 
nected, rational gain matrices (up to the invariance property of Lemma [1]). Specifically, Theorem Q] 
shows that certain fully connected real, scalar GIFCs with irrational, algebraic coefficients have 
degrees-of-freedom exactly equal to the known upper bound of K/2, the first such examples for 
real, scalar GIFCs. Theorem [21 on the other hand, shows that if all coefficients are non-zero ratio- 
nals, the degrees-of-freedom is strictly bounded away from K/2, for K > 2. These theorems are 
established by appealing to major results in mathematics on the inapproximability of irrational, 
algebraic numbers by rational numbers, in the case of Theorem [H and on the combinatorics of 
additive sets, in the case of Theorem [2j In the latter case, previously used information theoretic 
converse techniques, which are not sensitive to the rationality of channel coefficients, do not suffice. 
We believe these results may have some implications for real GIFCs under channel parameter un- 
certainty, since in this case, channel coefficients with irrational and rational coefficients would have 
to be dealt with simultaneously. Additionally, in practical systems, computations for encoding and 
decoding are ultimately restricted to finite precision, and hence rational numbers, suggesting that 
additive combinatorics based bounds on achievable rates may have practical relevance. 

Throughout this paper, we have been concerned with real, scalar GIFCs. Theorem Q] can be 
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readily extended to the complex and vector cases, revealing an additional class of K/2 degrees-of- 
freedom vector GIFCs, complementing those already known [9|. The extension of Theorem [2] to 
complex and vector GIFCs seems less trivial. For instance, the example of a K/2 degrees-of-freedom 
achieving i^T-user two-dimensional vector GIFC in [9] actually has integer coefficients, though they 
are a very special choice. Any extension of Theorem [2] would have to avoid such special cases. 
More significantly, our crucial Lemma [TT] can be shown, using the interference alignment technique 
of [9], to not hold in the complex or vector cases. Nevertheless, we conjecture that for any fixed 
(complex) vector dimension, limitations on the degrees-of-freedom similar to Theorem [2] do exist 
for a sufficiently large number of users K. As noted, establishing such a result would require 
analyzing few-user GIFCs that are more complicated than the 3-user channel of Lemma [TTJ It is 
likely that tools from additive combinatorics will still prove useful, though they would need to be 
applied differently from the proof of Lemma [TTJ The topic of characterizing the degrees-of-freedom 
of rational vector GIFCs and scalar, complex GIFCs having channel gains with rational real and 
imaginary parts is left for future work. 
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